Combining Database and Information Extraction Techniques to Discover Structure From Partially Structured Data
نویسنده
چکیده
This paper shows how Information Extraction and Semantic Web Ontology technologies can be combined with information integration techniques in the AutoMed framework to extend the facilities provided by databases for handling free text data. This paper gives a design for a demonstrator system ESTEST (Experimental Software to Extract Structure from Text). This design has several novel features. In particular database schema information is combined with language and domain ontologies. This enhanced metadata is then used to drive information extraction tasks to discover new structure that is used to extend the structured data and its schema.
منابع مشابه
Combining data integration and information extraction
Abstract Improving the ability of computer systems to process text is a significant research challenge. Many applications are based on partially structured databases, where structured data conforming to a schema is combined with free text. Information is stored as text in these applications because the queries requiredImproving the ability of computer systems to process text is a significant re...
متن کاملIdentification of Fraud in Banking Data and Financial Institutions Using Classification Algorithms
In recent years, due to the expansion of financial institutions,as well as the popularity of the World Wide Weband e-commerce, a significant increase in the volume offinancial transactions observed. In addition to the increasein turnover, a huge increase in the number of fraud by user’sabnormality is resulting in billions of dollars in lossesover the world. T...
متن کاملIdentification of Fraud in Banking Data and Financial Institutions Using Classification Algorithms
In recent years, due to the expansion of financial institutions,as well as the popularity of the World Wide Weband e-commerce, a significant increase in the volume offinancial transactions observed. In addition to the increasein turnover, a huge increase in the number of fraud by user’sabnormality is resulting in billions of dollars in lossesover the world. T...
متن کاملThe ESTEST System - Combining Data Integration and Information Extraction
We describe an approach which combines techniques from Data Integration and Information Extraction in order to make better use of the unstructured data found in applications built over databases containing both structured data and text. We contrast this approach to similar work and then give details of the implementation of our ESTEST system. ESTEST integrates available data sources into a glob...
متن کاملCombining information extraction and data integration in the estest system
We describe an approach which builds on techniques from Data Integration and Information Extraction in order to make better use of the unstructured data found in application domains such as the Semantic Web which require the integration of information from structured data sources, ontologies and text. We describe the design and implementation of the ESTEST system which integrates available stru...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003